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OO  ON  O  O 


INTRODUCTION 


t. 


Cell-extracellular  matrix  (ECM)  interaction  has  a  profound  impact  on 
cell  structure  and  function.  There  is  overwhelming  evidence  that  the 
ECM  influences  mammary  epithelial  cell  differentiation.  Cell-ECM 
interaction  is  mediated  through  a  variety  of  cell  surface  receptors 
including  integrins.  The  a2(31  integrin,  a  collagen/laminin  receptor, 
is  expressed  in  mammary  epithelial  cells  and  plays  a  critical  role  in 
normal  mammary  cell  differentiation  as  well  as  in  the  pathogenesis 
of  breast  cancer.  Modulation  of  a2pl-ECM  interaction  could  affect 
mammary  cell  differentiation.  Elucidation  of  a2(3l -ligand  binding 
mechanisms  may  lead  to  the  development  of  therapeutic 
interventions  against  breast  cancer. 

The  ligand  binding  site(s)  in  a2pi  has  been  localized  in  the  I  (A) 
domain  of  the  a2  subunit.  Although  several  amino  acid  residues  that 
constitute  the  metal  ion-dependent  adhesion  site  (MIDAS)  are  critical 
for  ligand  binding,  the  specific  contact  site  for  each  ligand  is  not  well 
characterized.  I  focused  my  effort  on  identifying  the  specific  amino 
acid  residues  that  make  direct  contact  with  collagen.  I  found  that 
multiple  discontinuous  amino  acid  residues  surrounding  the  MIDAS 
motif  make  direct  contact  with  collagen.  Based  upon  the  crystal 
structure  of  the  a2  I  domain,  I  created  a  docking  model  of  a2  I 
domain  and  collagen  triple  helical  peptide.  Using  the  same  strategy, 

I  will  be  able  to  identify  residues  that  make  contact  with 
recombinant  laminin  al  chain.  Once  the  specific  mutations  that  block 
either  collagen  or  laminin  are  determined,  I  will  express  these 
mutations  in  a  breast  cancer  cell  line  and  examine  their  effect  on 
differentiation.  The  information  obtained  in  this  study  may  form  the 
basis  for  the  development  of  therapeutic  interventions  against  breast 
cancer. 
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BODY  -  Annual  Summary 


Discontinuous  multiple  amino  acid  residues  in  the  I  domain  are 
required  for  collagen  binding 

The  MIDAS  face,  the  putative  ligand  binding  face,  of  the  a2  I  domain 
is  composed  of  four  discontinuous  loop  structures.  The  PA-al,  a3-a4, 
and  pD-a5  loops  contain  amino  acid  residues  that  compose  the 
MIDAS  motif,  which  is  critical  for  divalent  cation  coordination.  These 
amino  acid  residues  are  conserved  relatively  very  well  among 
different  I  domains.  On  the  contrary,  the  amino  acid  sequences  of 
the  PE-oc6  loop  are  variable.  Notably,  there  are  five  amino  acid 
insertions  within  this  loop  in  collagen-binding  al,  a2,  and  alO 
integrin  I  domains  (Fig.  1).  The  crystal  structure  of  a2  I  domain 
reveals  that  these  inserted  residues  constitute  an  extra  C-helix 
within  the  pE-a6  loop.  This  C-helix  creates  a  groove  on  top  of  the 
MIDAS  face  in  which  a  collagen  triple  helix  is  predicted  to  dock. 
Therefore  the  C-helix  is  predicted  to  be  the  major  determinant  of 
collagen  binding.  I  reported  in  the  previous  annual  report  that  the 
collagen  binding  sites  in  a2pi  are  localized  in  the  three  discontinuous 
loops  (PA-al  loop,  a3-a4  loop,  and  pD-a5  loop)  within  the  I  domain 
of  the  a2  subunit  based  on  the  results  from  a2/aL  loop-swapping 
mutagenesis.  To  further  characterize  the  collagen-contact  face,  I 
introduced  multiple  point  mutations  into  the  residues  within  the  PA- 
al,  a3-a4,  pD-a5,  and  pE-a6  loops.  I  stably  expressed  mutant  a2  in 
CHO  cells  and  examined  their  binding  to  immobilized  collagen  type  I. 
Consistent  with  the  results  from  loop-swapping  mutagenesis,  I  found 
that  Ser-153,  Ile-156,  and  Tyr-157  in  pA-ccl;  Gln-215  and  Gly-218  in 
a3-a4;  and  Gly-255  in  pD-a5,  in  addition  to  the  previously  reported 
Asp- 151,  Thr-221,  and  Asp-254,  are  critical  for  collagen  binding.  In 
addition,  mutation  of  Asn-154,  Ser-155,  Asp-219,  and  His-258  had  a 
mild  blocking  effect  on  collagen  binding  when  collagen  type  I  was 
immobilized  at  2  pg/ml(Fig.  2).  Unexpectedly,  mutations  in  the 
amino  acid  residues  in  the  PE- a 6  loop  did  not  affect  collagen  binding 
at  all.  It  is  possible  that  several  amino  acid  residues  in  the  PE-a6 
loop  have  to  be  mutated  at  the  same  time  to  detect  their  effect  on 
collagen  binding.  To  rule  out  this  possibility,  I  created  a  deletion 
mutant  that  lacks  most  of  the  residues  in  the  pE-a6  loop,  including 
the  entire  C-helix.  I  generated  a  clonal  CHO  cell  line  expressing  the 
mutant  a2  and  examined  its  binding  to  plastic  plates  that  had  been 
coated  with  collagen  type  I  at  various  concentrations.  The  del  aC 
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mutant  showed  binding  to  collagen  type  I  comparable  with  that  of 
wild-type  at  all  collagen  immobilizing  concentrations  (Fig.  3).  These 
results  suggests  that  the  del  aC  mutant  has  a  collagen-binding 
affinity  comparable  to  that  of  wild-type  a2;  therefore  the  unique  C- 
helix  does  not  contain  an  energetically  important  collagen  contact 
site.  Instead,  the  collagen  contact  sites  are  localized  in  the  relatively 
conserved  sequences  surrounding  the  MIDAS  motif. 

a2  I  domain/collagen  binding  model 
In  collaboration  with  Dr.  Liddington,  we  created  an  a2  I 
domain/collagen  docking  model  based  on  the  current  mutagenesis 
data.  In  the  crystal  structure,  Asp-151,  Ser-153,  Ser-155,  Thr-221, 
and  Asp-254  are  involved  in  the  coordination  of  the  cation.  lie- 156 
is  totally  buried  in  the  molecule.  Mutations  of  Gln-215,  Gly-218,  and 
Gly-255  are  either  buried  or  likely  to  disrupt  the  MIDAS  motif.  In 
contrast,  Asn-154,  Tyr-156,  Asp-219,  and  His-258  are  totally 
exposed  on  the  surface  of  the  molecule,  suggesting  that  these 
residues  make  direct  contact  with  collagen.  Recently,  a  short 
synthetic  triple-helical  peptide,  representing  residues  502-516  of  the 
collagen  type  I  al  chain,  has  been  shown  to  support  purified  a 2 (31 
and  recombinant  a2  I-domain. binding.  The  Glu  and  Arg  residues  in 
the  GER  triplet  were  found  to  be  essential  for  recognition  by  the  a2  I- 
domain.  We  first  attached  the  glutamate  side  chain  of  the  GER  motif 
to  the  MIDAS  Mg2+  ion,  then  rotated  the  collagen  to  minimize  the 
distance  between  the  collagen  and  those  surface-exposed  residues 
implicated  in  collagen  binding  (Asn-154,  Tyr-157,  Asp-219  and  His- 
258)  while  maintaining  the  2  A  bond  between  the  glutamate  oxygen 
and  the  Mg2+  ion  and  avoiding  other  close  contact  with  the  protein. 

In  the  current  model,  Asn-154,  Tyr-157,  Asp-219,  and  His-258  make 
direct  contact  with  collagen,  and  the  arginine  from  the  GER  motif 
makes  a  salt  bridge  to  Glu-256  (Fig.  4).  In  addition,  Tyr-285  from 
the  C-helix  also  makes  direct  contact  with  collagen.  However,  current 
mutagenesis  data  suggest  its  not  energetically  important. 

Recommendations  in  Relation  to  the  Statement  of  Work 

Thus  far  I  have  focused  my  effort  on  identifying  the  collagen  contact 

site  in  a2|31.  Using  the  same  strategy,  I  plan  to  identify  the  laminin- 

1  contact  site  using  a  recombinant  laminin  al  chain.  The  collagen 

and/or  laminin-binding-defective  mutants  produced  will  be  useful 

for  elucidating  the  role  of  cell-ECM  interactions  in  breast  cancer  cell 

differentiation. 
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Fig.  1  Alignment  of  l-domains.  The  loops  comprising  the  MIDAS  motif  (pA-al ,  a3-a4, 
PD-a5)  and  the  loop  which  includes  aC  helix  (PE-a6)  are  outlined  in  boxes.  The  asterics 
indicate  aminmo  acid  residues  critical  for  a2pi -collagen  interaction. 
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Fig.  3  The  effect  of  del  aC  mutation  on  a2(31  -collagen  type  I  interaction. 

Cells  were  incubated  for  one  hour  in  wells  which  has  been  precoated  with 
different  concentration  of  collagen  type  I.  After  washing,  bound  cells  were 
quantitated  by  endogenous  phosphatase  assay. 


Fig.  4  Docking  model  of  the  a2  l-domain  and  collagen 

All-atom  representation  of  the  a2  I  domain,  viewed  looking  down  onto 
the  MIDAS  face.  The  Mg2+  ion  is  labeled  at  the  center  of  the  molecule. 

The  collagen  molecule  is  shown  as  a  triple  helical  coil  drawn  through  the 

Ca  positions.  Asn-154,  Tyr-157,  Asp-219,  His-258  and  Tyr-285  are  labeled. 
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1)  Key  research  accomplishments 

a)  Identification  of  the  collagen  contact  site  in  the  I  domain  of 
a  2  (31  integrin. 

b)  Generation  of  collagen/a 2  I  domain  docking  model. 

2)  Reportable  outcomes 

a)  Manuscripts:  Tetsuji  Kamata,  Robert  C.  Liddington,  and 

Yoshikazu  Takada.  Interaction  between  collagen  and  the  a2 
I-domain  of  integrin  a2(31:  Critical  role  of  conserved  residues 
in  the  MIDAS  region.  J.  Biol  Chem.,  in  press. 

3)  See  attached  copy  of  the  manuscript  cited  in  2a. 


To  appear  in  J.  Biol*  Cheuu 
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Title:  Interaction  between  collagen  and  the  a2  I-domain  of  integrin  a2(Jl: 
Critical  role  of  conserved  residues  in  the  MEDAS  region. 

Authors:  Tetsuji  Kamata,  Robert  C.  Liddington**,  and  Yoshikazu  Takada* 
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ABSTRACT 


< 


A  docking  model  of  the  a2  I-domain  and  collagen  has  been  proposed  based 
on  their  crystal  structures  (Emsley,  J.,  King,  S.,  Bergelson,  J.  and  Liddington, 
R.  C.,  J.  Biol.  Chem.,  272,  28512-28517,  1997).  In  this  model,  several  amino 
acid  residues  in  the  I-domain  make  direct  contact  with  collagen  (N154, 
D219,  L220,  E256,  H258,  Y285,  N289,  L291,  N295,  and  K298),  and  the 
protruding  C-helix  of  a2  (residues  284-288)  determines  ligand  specificity. 
Since  most  of  the  proposed  critical  residues  are  not  conserved,  different  I- 
domains  are  predicted  to  bind  to  collagen  differently.  We  found  that 
deleting  the  entire  C-helix  or  mutating  the  predicted  critical  residues  had 
no  effect  on  collagen  binding  to  whole  a2f$l,  with  the  exception  that 
mutating  N154,  D219,  and  H258  had  moderate  effect.  We  performed 
further  studies  and  found  that  mutating  the  conserved  surface-exposed 
residues  in  the  MIDAS  (Y157  and  Q215)  significantly  blocks  collagen 
binding.  We  have  revised  the  docking  model  based  on  the  mutagenesis 
data.  In  the  revised  model,  conserved  Y157  makes  contact  with  collagen  in 
addition  to  the  previously  proposed  N154,  D219,  H258,  and  Y285  residues. 
These  results  suggest  that  the  collagen-binding-I-domains  (e.g.,  al,  a2,  and 
a  10)  bind  to  collagen  in  a  similar  fashion. 
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INTRODUCTION 

Several  integrin  a  chains  (al,  a2,  alO,  aL,  aM,  aX,  aD,  and  aE)  have 
inserted  I-  or  A-  domains  of  about  200  amino  acid  residues  (1-11)). 
Integrins  alpl  (1),  a2pl  (reviewed  in  (12)),  and  alOpl  (4)  have  been  shown 
to  bind  to  collagen  and/or  laminin.  Several  function-blocking  antibodies 
map  to  the  I-domains  of  a2pl  (13)  and  alpl  (14).  The  recombinant  a2  I- 
domain  fragment  binds  to  collagen  (15,16),  and  the  recombinant  al  I- 
domain  fragment  binds  to  collagen  and  laminin  (17).  Conserved  Asp  and 
Thr  residues  in  the  a2  I-domains  (D151,  T221,  and  D254)  are  critical  for 
collagen  binding  (15).  These  lines  of  evidence  suggest  that  the  I-domain  is 
critically  involved  in  collagen  binding. 

The  crystal  structures  of  the  I-domains  of  the  integrin  aM  (18),  aL 
(19),  and  a2  (20)  subunits,  and  the  Al  (21,22)  and  A3  (23,24)  domains  of 
von  Willebrand  factor  (vWf)  have  been  published.  This  domain  adopts  a 
classic  "Rossmann"  fold  and  consists  of  a  hydrophobic  P -sheet  in  the  middle 
and  amphipathic  a -helices  on  both  sides.  Interestingly,  the  integrin  I- 

domain  contains  a  Mg^+/Mn^+  coordination  site  at  its  surface,  which  is  not 
present  in  proteins  with  similar  structures  (e.g.,  the  NAD  binding  domain  of 
lactate  dehydrogenase)  or  the  vWf  Al  and  A3  domains  (21-24).  The  Asp 
and  Thr  residues  in  a  2  that  have  been  shown  to  be  critical  for  ligand 
binding  are  involved  in  the  coordination  of  a  divalent  cation  in  the  crystal 
structure  (20).  The  a2  I-domain  has  a  unique  helix  (the  C-helix)  protruding 
from  the  metal  ion  dependent  adhesion  site  (MIDAS)  that  creates  a  groove 
centered  on  the  magnesium  ion.  Emsley  et  al.  proposed  a  model  in  which  a 
collagen  triple  helix  fits  into  die  groove,  and  a  Glu  side  chain  from  collagen 
coordinates  the  metal  ion.  In  this  model,  the  C-helix  is  a  major  determinant 
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for  collagen  binding.  It  was  as  predicted  that  the  following  I-domain 
residues  make  direct  contact  with  collagen:  N154  (the  J3A-al  turn),  D219 
and  L220  (the  a3-oc4  turn),  E256  and  H258  (the  pD-a5  turn),  and  Y285, 
N289,  L291,  N295  and  K298  (the  C-helix,  a6  and  C-a6  turn).  However, 
these  residues  are  not  well  conserved  among  collagen-binding  I-domains 
(e.g.,  al,  a2,  and  a  10),  suggesting  that  different  I-domains  interact  with 
collagen  in  different  manners.  Here  we  show  that  mutation  of  the  residues 
proposed  to  be  critical  for  ligand  binding  or  deletion  of  the  entire  C-helix 
did  not  significantly  affect  collagen  binding  to  whole  a2j3l  expressed  on 
mammalian  cells  except  for  N154,  D219,  and  H258.  In  contrast,  mutating 
several  conserved  MIDAS  residues  including  Y157  significantly  blocks 
collagen  binding.  We  have  revised  the  docking  model  based  on  the 
mutagenesis  data.  In  the  revised  model,  interaction  between  the  a2  I- 
domain  and  collagen  is  mediated  by  relatively  conserved  residues  in  the 
MIDAS  on  the  N-terminal  side  of  the  I-domain.  Thus,  it  is  suggested  that 
the  collagen-binding  I-domains  (e.g.,  al,  a2,  and  a  10)  bind  collagen  in  a 
similar  fashion. 
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EXPERIMENTAL  PROCEDURES 

Monoclonal  antihodies.  HAS-3  and  HAS-4  (25)  are  generous  gifts  from  F. 
Watt  (Imperial  Cancer  Research  Fund,  London,  UK.) 

Adhesion  of  CHO  cells  to  collagen. 

Wells  of  96  well  microtiter  plates  (Immulon-2,  Dynatech  Labs.,  Inc., 
Chantlly,  VA)  were  coated  with  type  I  collagen  (2  or  10  jig/ml)  at  4  °C 
overnight.  The  other  protein  binding  sites  were  blocked  by  incubating  with 
1  %  (w/v)  bovine  serum  albumin  (Calbiochem,  CA)  for  30  min  at  room 
temperature,  and  washing  three  times  with  PBS  (10  mM  phosphate,  0.15  M 
NaCl,  pH  7.4).  Cells  were  harvested  with  3.5  mM  EDTA  in  PBS  and  washed 
twice  with  Dulbecco’s  modified  Eagle  medium.  105  cells  (in  100  jil 
Dulbecco's  modified  Eagle  medium)  were  added  to  each  well  and  incubated 

for  1  h  at  37  °C.  The  wells  were  rinsed  three  times  with  PBS  to  remove 
unbound  cells.  Bound  cells  were  quantified  by  assaying  endogenous 
phosphatase  activity  (26). 

Molecular  modeling. 

A  model  of  a  collagen  triple  helix  was  constructed  from  the  crystal 
structure  [(27),  PDB  entry  lcag]  as  previously  described  (20).  Side-chains 
from  the  sequence  of  the  CB3(I)5/6  peptide  containing  the  GER  motif  (28) 
were  grafted  onto  the  collagen  in  standard  conformations  using  the 
program  TOM  (29).  The  glutamate  of  one  of  the  GER  motifs  was  attached  to 

the  Mg2+  ion  of  the  MIDAS  motif  via  one  of  its  carboxylate  oxygens  at  a 
distance  of  2.0  A.  Keeping  the  I-domain  fixed,  the  collagen  was  then 
allowed  to  rotate  around  a  fixed  point  (the  glutamate  oxygen)  to  minimize 
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the  distance  between  the  collagen  and  the  side  chains  of  those  residues 
which  showed  reduced  collagen  binding  when  mutated  and  which  were 
exposed  on  the  surface  of  the  I-domain.  Unfavorably  close  contacts  (<  2.5 
A)  between  the  collagen  and  the  I-domain  were  monitored  using  the 
program  TOM.  Since  the  triple  helical  nature  of  collagen  generates  three 
chemically  distinct  strands  even  for  a  homo-tripeptide  (which  we  call  the 
leading,  middle,  and  trailing  strands)  each  of  these  was  tested  separately. 

Other  methods. 

Swapping  mutagenesis  was  carried  out  using  the  overlap  extension 
polymerase  chain  reaction  (30).  The  positions  of  the  a2  sequences  replaced 
by  homologous  aL  sequences  are  residues  152-157,  212-219,  and  257-262 
(designated  pA-al,  a3-a4,  and  pD-a5,  respectively)(Fig.  1).  Deletion  of 
residues  284-291  (designated  aC  del)  and  point  mutations  were  created  by 
site-directed  mutagenesis  using  the  unique  site  elimination  method  with  a 
double  stranded  vector  (31).  The  presence  of  mutation  was  confirmed  by 
DNA  sequencing.  Transfection  of  cDNAs  into  CHO  cells  by  electroporation, 
selection  of  transfected  cells  with  G418,  immunoprecipitation,  and  flow 
cytometry  were  carried  out  as  previously  described  (32). 
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RESULTS  AND  DISCUSSION 

The  MIDAS  of  the  a2  I-domain  is  composed  of  four  loops  (the  pA-al, 
a3-a4,  pD-a5,  and  |IE-a6  loops).  The  conserved  residues  D151  in  the  PA-al 
loop,  T221  in  the  a3-a4  loop,  and  D254  in  the  pD-a5  loop  are  critical  for 
cation  coordination  and  ligand  binding  (13,15).  A  unique  C-helix  is  inserted 
within  the  pE-a6  loop  in  the  I-domains  of  the  collagen-binding  integrins 
al,  a2,  and  alO,  but  is  not  present  in  the  I-domains  of  aM  or  aL,  or  the  A3 
domain  of  vWf.  This  C-helix  has  been  predicted  to  be  a  major  determinant 
for  collagen  binding  (20). 

To  identify  the  residues  in  the  MIDAS  that  are  critical  for  collagen 
binding,  we  introduced  multiple  point  mutations  into  each  MIDAS  loop.  We 
also  included  amino  acid  residues  (N154,  D219,  L220,  E256,  H258,  Y285, 
N289,  L291,  and  N295)  that  have  been  predicted  to  make  direct  contact 
with  collagen  (20).  Mutant  a 2  was  transfected  into  CHO  cells  together  with 
a  neomycin-resistant  gene  and  selected  for  G-418  resistance.  Cells  stably 
expressing  the  mutant  a2  were  used  for  adhesion  assays.  Fig.  2  shows  the 
adhesion  of  the  mutants  to  collagen  type  I  expressed  as  a  percentage  of 
cells  adherent  to  collagen  per  percentage  of  human  a2  positive  cells 
(normalized  adhesion  to  collagen).  We  found  that  mutating  several 
residues  in  the  PA-al  loop  (S153  and  Y157)  and  the  a3-a4  loop  (Q215) 
blocks  collagen  binding.  In  addition,  mutation  of  N154  and  S155  in  the  PA- 
al  loop,  D219  in  die  a3-a4  loop,  and  H258  in  the  pD-a5  loop  produced  an 
inhibitory  effect  at  lower  collagen  coating  concentrations. 

We  also  swapped  the  PA-al  (residues  152-157),  a3-a4  (residues 
212-219),  and  pD-a5  loops  (residues  257-262)  with  the  corresponding 
sequences  of  aL,  which  does  not  interact  with  collagen  (Fig.  1).  These 
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swapping  mutations  did  not  change  the  conserved  residues  D151,  T221, 
and  D254,  which  are  critical  for  cation  and  collagen  binding.  Cells 
expressing  mutant  a 2  were  tested  for  their  ability  to  adhere  to  collagen. 
The  expression  of  the  a3-oc4  swapping  mutant  was  too  low  to  produce 
reliable  adhesion  data  (data  not  shown).  Other  mutants  showed  a  surface- 
expression  level  comparable  with  that  of  wild  type  and  reacted  with 
multiple  mAbs  against  a2  (Fig.  3a).  The  pA-al  swapping  mutant  showed 
collagen  binding  at  a  background-level.  Also,  the  pD-a5  swapping  mutant 
showed  significantly  reduced  collagen  binding.  These  results  are  consistent 
with  those  obtained  using  alanine-scanning  mutagenesis. 

In  contrast,  mutation  of  amino  acid  residues  in  the  pE-a6  loop, 
including  the  C-helix,  did  not  have  any  inhibitory  effect  on  collagen 
binding.  Mutation  of  Y285,  N289,  L291,  and  N295,  which  are  predicted  to 
make  direct  contact  with  collagen,  did  not  significantly  affect  collagen 
binding,  even  at  low  (2  pg/ml)  collagen  coating  concentrations  (Fig.  2).  It  is 
possible  that  single  amino  acid  substitution  may  not  be  enough  to  induce  a 
detectable  effect  on  collagen  binding.  So,  we  deleted  most  of  the  pE-a6 
loop,  including  the  entire  C-helix,  to  determine  whether  the  C-helix  is 
critical  for  ligand  specificity.  These  mutant  a2  cDNAs  were  stably 
expressed  on  CHO  cells,  and  further  cloned  to  obtain  high  expressors.  The 
aC  deletion  mutant  showed  collagen  binding  at  a  level  comparable  to  that 
of  wild  type  (Fig.3a).  Adhesion  of  the  aC  deletion  mutant  as  a  function  of 
collagen  coating  concentration  was  tested.  Adhesion  to  collagen  of  both 
wild  type  and  aC  deletion  mutant  a2pi  was  saturated  at  about  1  |xg/ml 
collagen  coating  concentration,  indicating  that  the  affinity  to  collagen  is  not 
affected  by  the  aC  deletion  (Fig. 3b). 
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These  results  suggest  that  collagen  binding  is  mediated  by  relatively 
conserved  MIDAS  residues,  which  are  located  on  the  N-terminal  side  of  the 
I-domain.  S153  and  S155  are  involved  in  metal  coordination,  and  mutating 
these  residues  would  disrupt  metal  binding  to  the  I-domain.  Q215  is  part 
of  the  MIDAS  face  and  makes  main  hydrogen  bond  to  DXSXS  loop.  It  is 
likely  that  mutating  these  residues  block  collagen  binding  by  disrupting 
metal  binding  to  the  I-domain.  N154,  D219,  and  H258  have  been  predicted 
to  make  direct  contact  with  collagen  in  the  previous  model  (20),  although 
the  effects  of  mutating  these  residues  are  moderate.  Y157  is  totally 
exposed  to  the  surface,  and  this  residue  may  make  direct  contact  with 
collagen.  Y157  has  not  been  predicted  to  make  direct  contact  with  collagen. 
Mutating  the  other  residues  that  are  predicted  to  be  critical  for  collagen 
binding  in  the  proposed  model  (L220,  E256,  Y285,  N289,  L291,  N295,  and 
K298)  has  no  significant  effect  on  collagen  binding.  Even  deletion  of  the 
entire  C-helix  did  not  significantly  affect  collagen  binding.  These  results 
support  the  role  of  the  MIDAS  motif  in  collagen  binding,  since  many  of  the 
mutants  that  affect  collagen  binding  are  predicted  to  disrupt  the  MIDAS 
motif.  However,  the  present  results  are  not  fully  consistent  with  the 
proposed  model  for  collagen/a2  I-domain  binding  (20). 

We  have  modified  the  collagen/a2  I-domain  docking  model  based  on 
the  present  mutagenesis  data  (Fig.  4).  Recently,  short  synthetic  triple¬ 
helical  peptide,  corresponding  to  residues  502-516  of  the  collagen  type  I 
al  chain,  has  been  shown  to  bind  to  purified  a2jll  and  recombinant  a2  I- 
domain  (28).  The  Glu  and  Arg  residues  in  the  GER  triplet  were  found  to  be 
essential  for  recognition  by  a2  I-domain  (28).  In  the  current  model,  we 

first  attached  the  glutamate  side  chain  of  die  GER  motif  to  the  MIDAS  Mg^+ 
ion.  We  then  rotated  the  collagen  to  minimize  the  distance  between  the 
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collagen  and  those  surface-exposed  residues  implicated  in  collagen  binding 
(H258,  Y157,  D219  and  N154)  while  maintaining  the  2  A  bond  between  the 
glutamate  oxygen  and  the  Mg2+  ion  and  avoiding  other  close  contacts  (<2.5 
A)  with  the  protein.  It  was  not  initially  possible  to  make  favorable 
hydrogen  bonds  with  all  four  side  chains  simultaneously,  so  the  side  chains 
were  allowed  to  rotate  about  their  Ca-Cp  bonds  in  order  to  make  plausible 
hydrogen  bonds  with  the  collagen  backbone  carbonyl  oxygens  and  amide 
nitrogens.  The  collagen  orientation  was  then  refined  to  optimize  the 
hydrogen  bonding  geometry.  This  procedure  allowed  all  four  I-domain  side 
chains  to  make  reasonable  hydrogen  bonds  to  the  collagen.  This  model 
predicts  that  the  side  chain  of  Tyr285,  which  projects  from  the  C  helix  into 
the  groove,  makes  unavoidable  contact  with  the  collagen,  and  that  further 
hydrogen  bonds  can  be  made  between  the  Tyr  hydroxyl  and  the  collagen 
main  chain.  This  revised  model,  which  is  rotated  about  30  degrees  from 
the  previously  published  model,  allows  the  arginine  from  the  GER  motif  of 
the  preceding  strand  to  make  a  salt  bridge  to  E256.  The  previous  model 
would  not  allow  enough  space  for  the  arginine  side  chain  without  imposing 
unfavorable  side  chain  torsion  angles.  Since  mutation  of  E256  does  not 
significantly  block  collagen  binding  in  the  present  study,  this  salt  bridge 
might  not  be  energetically  important.  The  triple  helical  character  of  a 
symmetric  collagen  trimer  generates  three  chemically  distinct  strands, 
which  we  call  the  leading,  middle,  and  lagging  strands.  Attaching  either  the 

leading  or  middle  strand  glutamate  to  the  Mg^+  ion  leads  to  the  same 
conclusions.  Attaching  the  trailing  strand  makes  a  difference  because  there 
is  no  arginine  from  the  preceding  strand  to  form  a  salt-bridge  to  E256.  The 
alternative  orientation  with  the  collagen  rotated  by  180  degrees  is  much 
less  favorable  because  the  arginine  of  the  GER  motif  would  clash  sterically 
with  the  I-domain.  As  pointed  out  in  our  previous  model,  an  aspartic  acid 
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side  chain  in  place  of  the  GER  glutamate  would  be  too  short  to  reach  the 

Mg2+  ion  without  creating  a  large  number  of  steric  clashes.  Since  N154, 
Y157,  H258  are  conserved  in  other  collagen-binding  integrin  I-domains 
(al  and  a  10),  it  is  reasonable  to  assume  that  these  residues  are  also 
involved  in  collagen  binding  in  these  integrins  (Fig.  4).  This  model  is 
consistent  with  the  observation  that  tyrosine  and  arginine  are  enriched  in 
hot  spots  of  binding  energy  in  the  protein-protein  interface  (33). 

In  the  present  model,  the  highly  conserved  D151  and  D254  residues 

in  the  a2  I-domain  are  buried  underneath  the  Mg2+  ion  and  can  not 
contact  collagen  directly.  We  have  reported  that  mutating  these  residues 
only  partially  affects  collagen  binding  to  the  recombinant  a2  I-domain 
fragment  (13,15).  Also,  Bienkowska  et  al.,  reported  that  mutating  the 
corresponding  residues  in  the  recombinant  fragment  of  the  vWf  A3 
domain  does  not  affect  collagen  binding  (23).  However,  the  same  mutation 
in  the  whole  a2  molecule  completely  blocks  collagen  binding  to  a2fJl 
(13,15).  It  is  possible  that  cation  coordination  through  these  residues  is 
critical  for  ligand  binding  in  the  I-domain  of  the  integrin  molecule,  but  not 
in  similar  domains  in  non-integrin  structures  (e.g.,  vWf).  Consistently,  the 
cation-binding  site  is  not  present  in  the  vWf  A3  domain  (23,24).  Further 
studies  would  be  needed  to  clarify  the  role  of  the  conserved  Asp  residues 
in  integrin  I-domains.  Several  other  collagen-binding  sites  have  been 
reported.  A  collagen  binding  surface  on  osteonectin  has  been  mapped  by 
mutagenesis:  it  consists  of  a  flat  surface  15  A  in  diameter  containing  polar 

and  apolar  residues  (34),  a  crucial  arginine  residue,  and  intriguingly  a  Ca^+ 
binding  site  that  might  be  directly  involved  in  collagen  binding.  Docking 
and  mutagenesis  of  Staphylococcus  aureus  adhesin  identified  another 
crucial  arginine  and  a  narrow  groove  as  the  binding  site  for  collagen  (35). 
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The  collagen  binding  surface  on  vWF-A3  has  not  been  mapped  but  the 
structurally  analogous  surface  lacks  the  charged  residues  found  in  the 
integrin  I-domains  (23,24).  Further  studies  will  be  required  to  determine 
whether  there  are  any  common  collagen  binding  mechanisms. 

While  this  paper  was  under  review,  the  crystal  structure  of  a  GER- 
containing  collagen  peptide/a2  I  domain  complex  has  been  solved  (Emsley, 
J.,  Knight,  C.G.,  Barnes,  MJ.,  Famdale,  R.W  and  Liddington,  R.,  unpublished 
results).  Preliminary  analysis  reveals  that  the  structure  is  very  similar  to 
the  revised  model  described  here.  Thus,  the  orientation  and  location  of  the 
collagen  is  as  predicted,  with  a  Glu  residues  from  the  collagen  coordinating 
directly  to  the  metal  ion.  In  addition,  there  is  an  unexpected  change  in  the 
C  helix  so  that  it  no  longer  touches  the  collagen,  in  agreement  with  the 
mutagenesis  results. 
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FIGURE  LEGEND 

Fig.  1:  Residues/loops  chosen  for  mutagenesis  in  this  study. 

The  I-domains  from  integrin  al,  a2,  alO,  aL,  aM,  aX,  aD,  and  aE  subunits 
and  the  vWf  Al  and  A3  domains  were  aligned.  The  p  strands  and  a-helices 
of  the  a2  I-domain  are  underlined.  Swapped  regions  of  a2  and  aL  (PA-al: 
152-157;  a3-a4:  213-219;  pD-a5:  257-262)  are  outlined  in  the  box.  The 
deleted  region  in  a2  (aC  del)  is  also  outlined. 

Fig.  2.  Effects  of  point  mutations  on  collagen  binding 

Cells  stably  expressing  wt  or  mutant  a 2  were  used  to  determine  adhesion 
to  collagen  (at  a  10  or  2  pg/ml  coating  concentration).  Data  are  presented 
as  %  bound  cells  to  collagen  per  %  human  a2  positive  cells  to  normalize  a2 
expression.  Typically  40-60%  of  cells  are  positive  after  selection  with  G- 
418.  Previously  published  function-negative  mutations  (D151A,  T221A, 
and  D254A)  are  included  as  negative  controls.  These  results  suggest  that 
several  relatively  conserved  residues  in  the  PA-al,  a3-a4,  and  PD-a5  loops 
are  critical  for  collagen  binding. 

Fig.  3:  Effects  of  swapping/deletion  mutations  on  collagen  binding. 
a)  Clonal  CHO  cells  stably  expressing  wild  type  or  mutant  a  2  were 
incubated  in  the  well  coated  with  collagen  type  I,  or  bovine  serum  albumin 
(negative  control).  After  incubation  at  37  C  for  1  h,  non-adherent  cells 
were  removed  and  bound  cells  were  determined  by  assaying  endogenous 
phosphatase.  Under  the  conditions  used  more  than  80%  of  cells  adhered  to 
fibronectin  as  a  positive  control.  MFI,  mean  fluorescence  intensity. 
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b)  Adhesion  to  collagen  of  wild  type  and  the  aC  deletion  mutant  a2  was 
determined  as  s  function  of  collagen  coating  concentrations.  The  data 
suggest  that  the  adhesive  function  of  the  aC  deletion  mutant  is  comparable 
to  that  of  wild  type. 

Fig.  4.:  A  revised  docking  model  of  the  «2  I-domain  and  collagen. 

All-atom  representation  of  the  a2  I-domain,  viewed  looking  down  onto  the 
MIDAS  face.  In  the  top  panel,  residues  with  mutations  that  reduce  collagen 
binding  are  in  red  (surface-exposed)  or  pink  (likely  to  disrupt  MIDAS). 
Residues  with  mutations  that  have  no  effect  on  collagen  binding  are  in 

cyan.  The  Mg^+  ion  is  shown  as  a  gray  ball.  The  new  collagen  model  is 
shown  as  a  colored  triple  helical  coil  (blue,  green  and  yellow)  drawn 
through  the  Ca  positions.  The  previous  model  (20)  is  shown  as  a 
transparent  triple  helical  coil.  Certain  residues  referred  to  in  the  text  are 
labeled.  In  the  bottom  panel,  residues  shown  in  red  are  invariant  between 
al,  a2  and  a  10  I-domains. 
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a2-I  140  CPSL-IDWWCDfeSNSIYPl — VTOAVKNFLEKFVQGLDIGPTKTQVGLIQYANNP 
aL-I  125  CIKGNVDLVFLFDGSMSLOPPEFQKILDFMKDVMKKL-- SNTSYQFAAVQFSTSY 
al-I  143  CSTQ-LDIVIVLDGSNSIYP — V3DSVTAFLNDLLKRMDIGPKQTQVGIVQYGENV 
alO-I  140  CPTY-MDWIVLDGSNSIYP — WSEVQTFLRRLVGKLFIDPEQIQVGLVQYGESP 
vWFA3  1686  CSQP-LDVI LLLDGSSSFPASYFDEMKSFAKAFI SKANIGPRLTQV SVLQYGSIT 
vWFAl  1272  CSRL-LDLVFLLDGSSRLSEAEFEVLKAFWDMMERLRISQKWVRVAWEYHDGS 
aE-I  180  GTEIAIILDGSGSIDPPDFQRAKDFISNMMRNFYEKCFECNFALVQYGGVI 
aM-I  128  CPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQL — KKSKTLFSLMQYSEEF 
aX-I  126  CPRQEQDIVFLIDGSGSISSRNFATMMNFVRAVISQF — QRPSTQFSLMQFSNNF 
aD-I  127  CPHQEMDIVFLIDGSGSIDQNDFNQMKGFVQAVMGQF — EGTDTLF ALMQY SNLL 


a2-I  192  RWFNLNTYKTKEEMIVATS  2TSQYGGD 1TNTFGAIQYARKYAYS  AASGGRRS  -  A 
aL-I  178  KTRFnF.qnyWRTCDPDAT.LKHVKHMLL-LTMTFGAINYVATEVFREELGARPD-A 
al-I  195  THEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRG-V 
alO-I  192  VHEWSLGDFRTKEEWRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPE-A 
VWFA3  1740  T IDVPWNWPEKAHLL  SLVDVMQREGG-  PSQIGDALGFAVRYLT  S  EMHGARPG  -  A 

vWFAl  1326  HAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIF - SKIDRPEA 

aE-I  232  QTEFDLRDFQDVMASLARVQNITQVGS-VTKTASAMQHVLDSIFTSSHGSRRK-A 
aM- I  181  RIHFTFKEFQNNPNPRSLVKPITQLLG-RTHTATGIRKWRELFNI TNGARKN-A 
aX-I  179  QTHFTFEEFRRTSNPLSLIiASVHQLQG-FTYTATAIQNWHRLFHASYGARED-A 
aD-I  180  KIHFTFTQFRTSPSQQSLVDPIVQLKG-I<TFTATGILTWTQLFHHKNGARKS-A 
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